70 research outputs found

    FunSimMat update: new features for exploring functional similarity

    Get PDF
    Quantifying the functional similarity of genes and their products based on Gene Ontology annotation is an important tool for diverse applications like the analysis of gene expression data, the prediction and validation of protein functions and interactions, and the prioritization of disease genes. The Functional Similarity Matrix (FunSimMat, http://www.funsimmat.de) is a comprehensive database providing various precomputed functional similarity values for proteins in UniProtKB and for protein families in Pfam and SMART. With this update, we significantly increase the coverage of FunSimMat by adding data from the Gene Ontology Annotation project as well as new functional similarity measures. The applicability of the database is greatly extended by the implementation of a new Gene Ontology-based method for disease gene prioritization. Two new visualization tools allow an interactive analysis of the functional relationships between proteins or protein families. This is enhanced further by the introduction of an automatically derived hierarchy of annotation classes. Additional changes include a revised user front-end and a new RESTlike interface for improving the user-friendliness and online accessibility of FunSimMat

    Ontology-based similarity measures and their application in bioinformatics

    Get PDF
    Genome-wide sequencing projects of many different organisms produce large numbers of sequences that are functionally characterized using experimental and bioinformatics methods. Following the development of the first bio-ontologies, knowledge of the functions of genes and proteins is increasingly made available in a standardized format. This allows for devising approaches that directly exploit functional information using semantic and functional similarity measures. This thesis addresses different aspects of the development and application of such similarity measures. First, we analyze semantic and functional similarity measures and apply them for investigating the functional space in different taxa. Second, a new software program and a new database are described, which overcome limitations of existing tools and simplify the utilization of similarity measures for different applications. Third, we delineate two applications of our functional similarity measures. We utilize them for analyzing domain and protein interaction datasets and derive thresholds for grouping predicted domain interactions into low- and high-confidence subsets. We also present the new MedSim method for prioritization of candidate disease genes, which is based on the observation that genes and proteins contributing to similar diseases are functionally related. We demonstrate that the MedSim method performs at least as well as more complex state-of-the-art methods and significantly outperforms current methods that also utilize functional annotation.Die Sequenzierung der kompletten Genome vieler verschiedener Organismen liefert eine große Anzahl an Sequenzen, die mit Hilfe experimenteller und bioinformatischer Methoden funktionell charakterisiert werden. Nach der Entwicklung der ersten Bio-Ontologien wird das Wissen über die Funktionen von Genen und Proteinen zunehmend in einem standardisierten Format zur Verfügung gestellt. Dadurch wird die Entwicklung von Verfahren ermöglicht, die funktionelle Informationen direkt mit Hilfe semantischer und funktioneller Ähnlichkeit verwenden. Diese Doktorarbeit befasst sich mit verschiedenen Aspekten der Entwicklung und Anwendung solcher Ähnlichkeitsmaße. Zuerst analysieren wir semantische und funktionelle Ähnlichkeitsmaße und verwenden sie für eine Analyse des funktionellen Raumes verschiedener Organismengruppen. Danach beschreiben wir eine neue Software und eine neue Datenbank, die Limitationen existierender Programme überwinden und den Einsatz von Ähnlichkeitsmaßen in verschiedenen Anwendungen vereinfachen. Drittens schildern wir zwei Anwendungen unserer funktionellen Ähnlichkeitsmaße. Wir verwenden sie zur Analyse von Domän- und Proteininteraktionsdatensätzen und leiten Grenzwerte ab, um die Domäninteraktionen in Teilmengen mit niedriger und hoher Konfidenz einzuteilen. Außerdem präsentieren wir die MedSim-Methode zur Priorisierung von potentiellen Krankheitsgenen. Sie beruht auf der Beobachtung, dass Gene und Proteine, die zu ähnlichen Krankheiten beitragen, funktionell verwandt sind. Wir zeigen, dass die MedSim-Methode mindestens so gut funktioniert wie komplexere moderne Methoden und die Leistung anderer aktueller Methoden signifikant übertrifft, die auch funktionelle Annotationen verwenden

    GOTax: investigating biological processes and biochemical activities along the taxonomic tree

    Get PDF
    GOTax, a novel web-based platform that integrates protein annotation with protein family classification and taxonomy, allows for an extensive assessment of functional similarity between proteins and for comparing and analyzing the distribution of protein families and protein functions over different taxonomic groups

    Applications of semantic similarity measures

    Get PDF
    There has been much interest in uncovering protein-protein interactions and their underlying domain-domain interactions. Many experimental techniques have been developed, for example yeast-two-hybrid screening and tandem affinity purification. Since it is time consuming and expensive to perform exhaustive experimental screens, in silico methods are used for predicting interactions. However, all experimental and computational methods have considerable false positive and false negative rates. Therefore, it is necessary to validate experimentally determined and predicted interactions. One possibility for the validation of interactions is the comparison of the functions of the proteins or domains. Gene Ontology (GO) is widely accepted as a standard vocabulary for functional terms, and is used for annotating proteins and protein families with biological processes and their molecular functions. This annotation can be used for a functional comparison of interacting proteins or domains using semantic similarity measures. Another application of semantic similarity measures is the prioritization of disease genes. It is know that functionally similar proteins are often involved in the same or similar diseases. Therefore, functional similarity is used for predicting disease associations of proteins. In the first part of my talk, I will introduce some semantic and functional similarity measures that can be used for comparison of GO terms and proteins or protein families. Then, I will show their application for determining a confidence threshold for domain-domain interaction predictions. Additionally, I will present FunSimMat (http://www.funsimmat.de/), a comprehensive resource of functional similarity values available on the web. In the last part, I will introduce the problem of comparing diseases, and a first attempt to apply functional similarity measures based on GO to this problem

    Molecular basis of telaprevir resistance due to V36 and T54 mutations in the NS3-4A protease of the hepatitis C virus

    Get PDF
    Structural analysis of the inhibitor Telaprevir (VX-950) of the hepatitis C virus (HCV) protease NS3-4A shows that mutations at V36 and/or T54 result in impaired interaction with VX-950, explaining the development of viral breakthrough variants

    Functional inhibition of acid sphingomyelinase by Fluphenazine triggers hypoxia-specific tumor cell death

    Get PDF
    Owing to lagging or insufficient neo-angiogenesis, hypoxia is a feature of most solid tumors. Hypoxic tumor regions contribute to resistance against antiproliferative chemotherapeutics, radiotherapy and immunotherapy. Targeting cells in hypoxic tumor areas is therefore an important strategy for cancer treatment. Most approaches for targeting hypoxic cells focus on the inhibition of hypoxia adaption pathways but only a limited number of compounds with the potential to specifically target hypoxic tumor regions have been identified. By using tumor spheroids in hypoxic conditions as screening system, we identified a set of compounds, including the phenothiazine antipsychotic Fluphenazine, as hits with novel mode of action. Fluphenazine functionally inhibits acid sphingomyelinase and causes cellular sphingomyelin accumulation, which induces cancer cell death specifically in hypoxic tumor spheroids. Moreover, we found that functional inhibition of acid sphingomyelinase leads to overactivation of hypoxia stress-response pathways and that hypoxia-specific cell death is mediated by the stress-responsive transcription factor ATF4. Taken together, the here presented data suggest a novel, yet unexplored mechanism in which induction of sphingolipid stress leads to the overactivation of hypoxia stress-response pathways and thereby promotes their pro-apoptotic tumor-suppressor functions to specifically kill cells in hypoxic tumor areas

    Improving disease gene prioritization using the semantic similarity of Gene Ontology terms

    Get PDF
    Motivation: Many hereditary human diseases are polygenic, resulting from sequence alterations in multiple genes. Genomic linkage and association studies are commonly performed for identifying disease-related genes. Such studies often yield lists of up to several hundred candidate genes, which have to be prioritized and validated further. Recent studies discovered that genes involved in phenotypically similar diseases are often functionally related on the molecular level
    corecore